Weakly Supervised U-Net with Limited Upsampling for Sound Event Detection
نویسندگان
چکیده
Sound event detection (SED) is the task of finding identities sound events, as well their onset and offset timings from audio recordings. When complete timing information not available in training data, but only are known, SED should be solved by weakly supervised learning. The conventional U-Net with global weighted rank pooling (GWRP) has shown a decent performance, extensive computation demanded. We propose novel limited upsampling (LUU-Net) threshold average (GTAP) to reduce model size, computational overhead. expansion along frequency axis decoder was minimized, so that output map sizes were reduced 40% at convolutional layers 12.5% fully connected without performance degradation. experimental results on mixed dataset DCASE 2018 Tasks 1 2 showed our GTAP about 23% faster achieved 0.644 tagging 0.531 tasks terms F1 scores, while GWRP 0.629 0.492, respectively. major contribution proposed LUU-Net reduction time being maintained or improved. other method, GTAP, further improved provides versatility for various mixing conditions adjusting single hyperparameter.
منابع مشابه
Weakly Supervised Action Detection
Detection of human action in videos has many applications such as video surveillance and content based video retrieval. Actions can be considered as spatio-temporal objects corresponding to spatio-temporal volumes in a video. The problem of action detection can thus be solved similarly to object detection in 2D images [3] where typically an object classifier is trained using positive and negati...
متن کاملTowards Accurate Event Detection in Social Media: A Weakly Supervised Approach for Learning Implicit Event Indicators
Accurate event detection in social media is very challenging because user generated contents are extremely noisy and sparse in content. Event indicators are generally words or phrases that act as a trigger that help us understand the semantics of the context they occur in. We present a weakly supervised approach that relies on using a single strong event indicator phrase as a seed to acquire a ...
متن کاملWeakly Supervised Object Detection with Posterior Regularization
Motivation: In weakly supervised object detection where only the presence or absence of an object category as a binary label is available for training, the common practice is to model the object location with latent variables and jointly learn them with the object appearance model [1, 5]. An ideal weakly supervised learning method for object detection is expected to guide the latent variables t...
متن کاملSentence Subjectivity Detection with Weakly-Supervised Learning
This paper presents a hierarchical Bayesian model based on latent Dirichlet allocation (LDA), called subjLDA, for sentence-level subjectivity detection, which automatically identifies whether a given sentence expresses opinion or states facts. In contrast to most of the existing methods relying on either labelled corpora for classifier training or linguistic pattern extraction for subjectivity ...
متن کاملCollaborative Learning for Weakly Supervised Object Detection
Weakly supervised object detection has recently received much attention, since it only requires imagelevel labels instead of the bounding-box labels consumed in strongly supervised learning. Nevertheless, the save in labeling expense is usually at the cost of model accuracy. In this paper, we propose a simple but effective weakly supervised collaborative learning framework to resolve this probl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13116822